Learning Visual Localization of a Quadrotor Using Its Noise as Self-Supervision
نویسندگان
چکیده
We introduce an approach to train neural network models for visual object localization using a small training set, labeled with ground truth positions and large unlabeled one. assume that the be localized emits sound, which is perceived by microphone rigidly affixed camera. This information used as target of cross-modal pretext task: predicting sound features from camera frames. By solving task, model draws self-supervision audio data. The well suited robot learning: we instantiate it localize quadrotor 128 × 80 pixel images acquired robot. Experiments on separate testing set show introducing auxiliary task yields performance improvements: Mean Absolute Error (MAE) estimated image coordinates reduced 7 4 pixels; MAE distance 28 cm 14 cm. A has access labels entire 2 pixels 11 cm, respectively.
منابع مشابه
the effect of using learning logs as a self-assessment tool on the syntactic development among iranian pre-intermediate efl learners
چکیده: هدف از انجام این پژوهش ،بررسی استفاده از یادداشتهای یادگیری به عنوان یکی از ابزار های خود ارزشیابی بر افزایش مهارت دستوری (صورتهای شرطی و مجهول) زبان آموزان در مقطع پیش متوسطه بوده است .بدین منظور یک تست استاندارد در مقطع پیش متوسطه بین 90 نفر زبان آموز در سنین 15 تا 20 سال در آموزشگاه زبان انگلیسی امین و پارسا در شهرستان شاهرود برگزار شد از میان این افراد 60 فراگیر انتخاب شدند. و این 6...
learning style, self efficacy and intrinsic motivation as predictors of iranian ielts reading comprehension
this thesis attempts to measure learning styles, self efficacy and intrinsic motivation as predictors of iranian ielts reading comprehension. in order to address this issue, a quantitative study was conducted on some randomly selected intact students at ferdowsi university. these two groups were assigned as they were undergraduate (ba=91) and graduate (ma =74) students; they were all aged betwe...
Learning Unsupervised Visual Grounding Through Semantic Self-Supervision
Localizing natural language phrases in images is a challenging problem that requires joint understanding of both the textual and visual modalities. In the unsupervised setting, lack of supervisory signals exacerbate this difficulty. In this paper, we propose a novel framework for unsupervised visual grounding which uses concept learning as a proxy task to obtain self-supervision. The simple int...
متن کاملLoss is its own Reward: Self-Supervision for Reinforcement Learning
Reinforcement learning optimizes policies for expected cumulative reward. Need the supervision be so narrow? Reward is delayed and sparse for many tasks, making it a difficult and impoverished signal for end-to-end optimization. To augment reward, we consider a range of selfsupervised tasks that incorporate states, actions, and successors to provide auxiliary losses. These losses offer ubiquito...
متن کاملSelf-Supervision for Reinforcement Learning
Reinforcement learning optimizes policies for expected cumulative reward. Need the supervision be so narrow? Reward is delayed and sparse for many tasks, making it a difficult and impoverished signal for end-to-end optimization. To augment reward, we consider a range of self-supervised tasks that incorporate states, actions, and successors to provide auxiliary losses. These losses offer ubiquit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2022
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2022.3143565